Organization and Programming of the Multistore Parser

نویسنده

  • Pier Paolo Pisani
چکیده

nize and explain structural patterns in natural-language sentences (specifically English) and eventually yield an output in which the relations between the various items of the sentence are hierarchically displayed. The recognition of these structural patterns is made by means of a system of rules which operate on a sequence of words, i.e. a sentence, whose individual characteristics are pre-established. By individual characteristics are meant the possibilities a word has to correlate (i.e. to form a syntactic combination) with another item; these possibilities are represented by 'correlators', that is, by syntactic elements which link two items in a correlation. Each word is characterized by a set of pre-established data: a) the S-code, which distinguishes between the various senses of a homograph. For instance, a word like "READ" These distinctions are essential,'since whenever a homo-graph occurs, one and only one of its meanings can be taken into consideration to make the final pattern, unless , of course, the sentence is ambiguous and more than-2-one final pattern is to be recognized, as in: i) present tense I READ THE BOOK ii) past tense b) the sequence of correlational indices (Ic's), that is, the string of potential links that each word-sense has. Each Ic represents a possible syntactic connection between two items and is identified by: i) the code number of the relation it establishes between two items; 2) the 'type' of correlation. There are six different types of correlation which split into two groups:'explicit' correlators and 'implicit' correlators. By 'explicit' correlator we mean a linking element which is represented by a linguistic item; prepositions and conjunctions are explicit correlators; by 'implicit' correlator we mean a relation between two items, which is not expressed by any linguistic item but is indicated by the relative position of the two items (which we call their correlational function).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Engineering in Persian Dependency Parser

Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

Query processing in multistore systems: an overview

Building cloud data-intensive applications often requires using multiple data stores (NoSQL, HDFS, RDBMS, etc.), each optimised for one kind of data and tasks. However, the wide diversification of data store interfaces makes it difficult to access and integrate data from multiple data stores. This important problem has motivated the design of a new generation of systems, called multistore syste...

متن کامل

BILEVEL LINEAR PROGRAMMING WITH FUZZY PARAMETERS

Bilevel linear programming  is a decision making problem with a two-level decentralized organization. The textquotedblleft leadertextquotedblright~ is in the upper level and the textquotedblleft followertextquotedblright, in the lower. Making a decision at one level affects that at the other one. In this paper, bilevel linear programming  with inexact parameters has been studied and a method is...

متن کامل

تولید درخت بانک سازه‌ای زبان فارسی به روش تبدیل خودکار

Treebanks is one of important and useful resource in Natural Language Processing tasks. Dependency and phrase structures are two famous kinds of treebanks. There have already made many efforts to convert dependency structure to phrase structure. In this paper we study an approach to convert dependency structure to phrase structure because of lack of a big phrase structure Treebank in Persian. A...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1969